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Abstract. The Mock LISA Data Challenges are a programme to demonstrate 
and encourage the development of LISA data-analysis capabilities, tools and 
techniques. At the time of this workshop, three rounds of challenges had been 
completed, and the next was about to start. In this article we provide a critical 
analysis of entries to the latest completed round, Challenge IB. The entries 
confirm the consolidation of a range of data-analysis techniques for Galactic and 
massive— black-hole binaries, and they include the first convincing examples of 
detection and parameter estimation of extreme-mass-ratio inspiral sources. In this 
article we also introduce the next round, Challenge 3. Its data sets feature more 
realistic waveform models (e.g., Galactic binaries may now chirp, and massive- 
black- hole binaries may precess due to spin interactions), as well as new source 
classes (bursts from cosmic strings, isotropic stochastic backgrounds) and more 
complicated nonsymmetric instrument noise. 

PACS numbers: 04.80.Nn, 95.55.Ym 

1. Introduction 

The Laser Interferometer Space Antenna (LISA), an ESA-NASA mission to survey 
the gravitational- wave (GW) sky at frequencies between 10~ 5 and 10 _1 Hz, will record 
gravitational radiation from millions of sources, most of them in our Galaxy, but many 
populating the low-to-high-redshift Universe pQ. Such a variety of signals, overlapping 
in both the time and frequency domains (to the point of creating confusion noise at 
some frequencies) poses a number of interesting new challenges for GW data analysis, 
whose solution is essential if we are to draw the greatest possible science payoff from 
such a bold and innovative observatory. 

At the end of 2005, the LISA International Science Team (LIST) initiated a 
programme of Mock LISA Data Challenges (MLDCs) with the goal of understanding 
at the conceptual and quantitative level the peculiarities of LISA data analysis, of 
demonstrating LISA's observational capabilities, and of kickstarting the development 
of data-analysis algorithms, pipelines, and infrastructural elements. An MLDC Task 
Force, chartered by the LIST, periodically issues challenge data sets containing GW 
signals from sources of undisclosed parameters, embedded in synthetic LISA noise; 
challenge participants have a few months to analyze the data and submit detection 
candidates, which are then compared with the sources originally injected in the data 
sets. ( Training data sets with public source parameters are also provided to help 
participants tune and troubleshoot their codes.) 

Three rounds of MLDCs had been completed at the time of this workshop, each 
spanning approximately six months. Challenge 1 [21 [5] was focused on establishing 
basic techniques to observe GWs from compact Galactic binaries, intrinsically 
monochromatic, isolated or moderately interfering; as well as from the inspiral phase 
of bright, isolated, nonspinning massive-black- hole (MBH) binaries. Challenge 2 
[H [5] featured three considerably more complex data-analysis problems: a data 
set containing GW signals from approximately 26 million Galactic binaries (again 
monochromatic) drawn from a randomized population-synthesis catalog; a data set 
(the "whole enchilada") with a similar Galactic-binary population, plus GW signals 
from an unknown number (between 4 and 6) of nonspinning-MBH binary and from 
five extreme-mass-ratio inspirals (EMRI); and five more data sets with single-EMRI 
signals. 

The very steep increase in complexity introduced over a short time-scale with 
Challenge 2 and the need to consolidate analysis techniques (especially so for EMRIs) 
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before moving to even more taxing challenges motivated the organization of Challenge 
IB, a repeat of Challenge 1 with the addition of single-EMRI data sets. Challenge- 
IB data sets were distributed in the late summer 2007, with a deadline of December 
2007 for entries. Ten collaborations submitted solutions. Highlights from this round 
include the range of techniques used, the participation of a number of new groups that 
successfully recovered signals from galactic binaries and MBH binaries, and the first 
convincing demonstration of EMRI detection and parameter estimation. Section [2] 
provides a brief summary of the entries; additional details about the work of individual 
collaborations are given elsewhere in this volume. 

As we write (April 2008), Challengc-3 data sets have just been released, with 
entries due at the beginning of December 2008. Challenge 3 represents a definite step 
in the direction of more realistic source models (such as chirping Galactic binaries 
and spinning-MBH binaries) and of new source classes (such as short-lived bursts and 
stochastic backgrounds). Section [3] describes the Challenge-3 data sets and waveform 
models in detail. 

2. Report on Challenge IB 

Challenge IB focused on three classes of GW sources, each tackled in a separate 
subchallenge: monochromatic Galactic binaries, MBH binaries and EMRIs. The 
Galactic-binary data sets (lB.l.la-c and IB. 1.2-5) and the MBH-binary data sets 
(IB. 2. 1-2) had a duration of approximately one year (31457280 s, sampled at intervals 
of 15 s), while the EMRI data sets (IB. 3. 1-5) were twice as long (with the same 
sampling time). The challenge solutions submitted by the participants were assessed 
with the simple criteria adopted in previous rounds [21 [5]. Detector-response data were 
generated with the best-fit source parameters A su b submitted by the participants, using 
the same code previously employed to build the challenge data sets. These data were 
then compared to the detector response to the true waveforms, using as a figure of 
merit the recovered SNR 



where A and E denote time series for the noise-orthogonal TDI observables (2X — 
Y — Z)/3 and (Z — Y)/y/3 [6] and (-|-) denotes the usual signal product weighted 
by instrument noise. We also quote the correlation C — SNR/SNR op t, where 

SNRopt = \/(-Atruc|-^true) + (-Etrue l^true) is the optimal SNR. For a perfect detection 
C = 1, but fluctuations ~ l/SNRo pt are expected because of instrument noise. When 
we examine parameter errors, these are defined simply as AA l = A* ub — A| me ; in 
some cases it makes sense to consider the fractional parameter errors A A 4 /A 1 = 
(-^sub — ^true)Mtruc- I n the rest of this section we briefly discuss the entries submitted 
by participants; the technical notes accompanying the entries can be found at 
www .tapir . caltech. edu/~mldc/results IB/results .html 

2.1. Galactic binaries: Challenges 1B.1.X 

Data sets lB.l.la-c and IB. 1.2-5 contained GW signals from monochromatic Galactic 
binaries, in a variety of parameter ranges and source combinations. Seven parameters 
are required to fully characterize each such source: the amplitude A, the (constant) 
frequency /, the ecliptic latitude and longitude j3 and A, the inclination and 



SNR(A sub ) 



(Atrucl-Asub) + (E t Tue\E su b) 



(1) 



\A^sub|^4sub) + (-E'subl-E'sub) 
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Figure 1. Sky positions reported by participants in Challenge IB. 1.1a, compared 
to the true location of the source. All the sky positions fall within the Fisher- 
matrix 1-cr contour, shown as a dotted line. (The spread of the reported sky 
positions is due to both the difference in noise realizations between the Synthetic 
LISA and LISA Simulator versions of data set IB. 1.1a, and the systematic errors 
of the searches.) 



polarization angles l and ip, and the initial phase <f>o- Entries were submitted by 
five groups: GSFC (scientists at Goddard Space Flight Center); IMPAN (the 
Institute of Mathematics of the Polish Academy of Science and the Institute of 
Theoretical Physics at the University of Wroclaw); AEI (the Albert Einstein Institute 
in Golm, Germany); MCMNJU (the Institute of Applied Mathematics of the Chinese 
Academy of Sciences and the Department of Astronomy of Nanjing University); 
UIBBham (the University of the Balearic Islands and the University of Birmingham). 
However, all groups except AEI concentrated only on a subset of the challenges. 

Participants employed a fair range of techniques, in the same broad class as 
adopted for similar challenges in the past [31 13]; however, new implementations and 
different technical solutions are being pursued. GSFC used the X-Ray Spectral 
Fitting Package (XSPEC) [7] to fit templates to energy spectra. The package includes 
a Levenberg-Marquardt optimization algorithm, which was used to obtain an initial 
guess for the source parameters. A Markov-Chain Monte Carlo (MCMC) routine, 
also available in XSPEC, was then used to converge to the best-fit source parameters. 
IMPAN set up a grid-based matched-filtering search with an optimized placement 
of templates on a hypercubic lattice. The JF-statistic [5] was used to reduce the 
search space from 7 to 3 parameters. A similar technique was adopted by AEI [9], 
in conjunction with a rigid-adiabatic model of detector response. MCMNJU used a 
genetic algorithm that optimized the JF-statistic; UIBBham implemented a MCMC 
search described in more detail in [TU] . 

Each of data sets lB.l.la-c contained a GW signal from a single monochromatic 
binary (differing by frequency), with SNR ss 13-25. Table [T] lists the correlations 
recovered by each collaboration. Some of the entries included close matches for the 
intrinsic parameters (/, 9, and <f>), but not for the remaining (extrinsic) parameters: 
this was due to the (relative) inaccuracy of the LISA response models used by the 
participants, or by mismatch between their definitions of the extrinsic parameters and 
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Table 1. Correlations for single-Galactic-binary challenges lB.l.la-c. Asterisks 
denote entries corrected by maximizing the Jf-statistic and using the resulting 
extrinsic parameters; these corrections are not reported where the frequency is 
well off, and the jF-statistic is merely fitting noise. 



Group 


IB. 1.1a 


IB. 1.1b 


IB. 1.1c 




SNRopt 


= 13.819 


SNRopt 


= 24.629 


SNRopt 


= 15.237 


AEI 


0.108 - 


-> 0.984* 


0.922 - 


-> 0.996* 


-0.190 


-> 0.989* 


GSFC 


0.992 




0.807 - 


-» 0.814* 


-0.138 




IMPAN 


0.988 




0.981 - 


-> 0.997* 


0.924 


-> 0.946* 


MCMNJU 


0.952 - 


-> 0.996* 


0.906 - 


-> 0.994* 


0.033 




UIBBham 


0.992 




0.996 









Table 2. Parameter errors for Challenges IB. 1.1a— c. All angles are expressed in 
radians. 



Group 


A/3 


AA 


A/ [nHz] 


Aif> 


At 


Atp 


AA [10^ 23 ] 






Challenge 


IB. 1.1a (/ tn 


,„ = 1.060 


mHz) 






AEI 


-0.032 


-0.120 


-2.43 


0.217 


-0.454 


1.17 


1.22 


GSFC 


-0.004 


-0.071 


-1.81 


0.708 


0.252 


1.33 


1.20 


IMPAN 


-0.031 


0.018 


2.13 


0.454 


0.212 


-1.06 


1.25 


MCMNJU 


-0.017 


-0.042 


-0.53 


0.662 


0.426 


-1.57 


2.34 


UIBBham 


0.005 


-0.079 


-1.51 


0.708 


0.173 


-1.32 


0.65 






Challenge 


IB. 1.1b (/ trl 


lc = 2.904 


mHz) 






AEI 


-0.056 


-0.0090 


0.95 


-1.050 


0.283 


1.63 


-0.066 


GSFC 


-0.462 


0.0606 


-30.9 


2.560 


0.182 


0.52 


-0.024 


IMPAN 


0.020 


0.0007 


0.85 


0.333 


0.339 


-0.60 


0.713 


MCMNJU 


-0.067 


-0.0063 


2.07 


-0.732 


-0.064 


0.84 


-0.223 


UIBBham 


-0.044 


-0.0082 


1.78 


-0.636 


0.043 


1.13 


-0.029 






Challenge 


IB. 1.1c (/ trl 


, e = 9.943 


mHz) 






AEI 


-0.026 


0.0053 


1.84 


-0.499 


-1.120 


3.02 


0.124 


GSFC 


-0.452 


-1.48 


140 


1.820 


-0.471 


-0.66 


-0.695 


IMPAN 


-0.016 


0.0248 


3.72 


-1.510 


-0.197 


2.68 


0.478 


MCMNJU 


-0.555 


-0.3680 


359 


-1.590 


-0.250 


-0.94 


-0.532 



the MLDC's. In these cases we recomputed C by maximizing the JF-statistic [5] for the 
intrinsic parameters provided; the resulting Cs are denoted by asterisks, and show that 
the intrinsic parameters were indeed recovered well enough to ensure solid detections. 
Table [5] lists parameter errors, and figure [1] shows where in the sky each collaboration 
placed the single binary of data set IB. 1.1a, compared to its true position, and to the 
Fisher-matrix 1-a error contour. 

Data set IB. 1.2 contained GW signals from 25 "verification" binaries of known 
(i.e., disclosed) frequency and sky location. Five of them were taken from the list 
of observed binaries on Gijs Nelemans' wiki (llj , while the remaining 20 were placed 
randomly in the Galaxy, varying their frequencies over a representative range. Table 
[3] lists the global correlations (computed for the combined signals of all reported and 
true binaries) recovered by the three groups that participated in this challenge. Just 
as it happened for Challenges lB.l.la-c, problems in assigning extrinsic parameters 
reduced the correlations. (Since the intrinsic parameters were provided to the 
participants, we did not perform ^-statistic-based adjustments, which would amount 
to solving the entire problem. As a result, the low C values of table [3] may be 
symptomatic only of extrinsic-parameter systematics.) 
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Table 3. Correlations, number of recovered sources, and number of false positives 
for Challenges IB. 1.2, IB. 1.4, IB. 1.5. Correlations computed after correcting 
extrinsic parameters using the J-'-statistic are denoted with asterisks. 



Group C 


# recovered 


IB. 1.2 (SNRopt = 


634.918, 25 sources) 


AEI -0.822 


25 


GSFC 0.006 


25 


MCMNJU 0.267 


25 


IB. 1.4 (SNRopt = 


340.233, 51 sources) 


AEI 0.774 -> 


0.966* 13 (2 false pos.) 


GSFC 0.003 -» 


0.282* 6 (1 false pos.) 


IB. 1.5 (SNRopt = 


273.206, 44 sources) 


AEI 0.453 -> 


0.929* 3 



Data set IB. 1.3 contained GW signals from 20 unknown binaries distributed 
across the LISA band, well separated in frequencies. Unfortunately, a bug in the 
random generation of source parameters caused all SNRs to be too small for detection 
(all were below 1). Happily, no participating group reported a positive detection, 
consistent with the correct behavior expected of search algorithms. 

Challenge IB. 1.4 was meant to test search algorithms in the presence of mild 
source confusion. Fifty-one sources were spread across a band of 15 /iHz beginning 
at 3mHz, with an average density of 0.108 sources per frequency bin. By contrast, 
Challenge IB. 1.5 tested algorithms in the presence of a higher level of source confusion, 
comparable to that expected from our Galaxy. Forty-four sources were spread across 
a band of 3 /xHz centered at 3 mHz, with an average density of 0.465 sources per 
frequency bin. Table[3]lists the global correlations and the number of sources recovered 
by the participating groups, as well as the number of false positives, defined here as 
reported sources farther than one frequency bin (1/year) from any true source, or with 
^-"-statistic-adjusted correlation less than 0.7 with all true sources within a frequency 
bin. The top panel of figure [2] shows the combined A and E spectral amplitude for the 
GW signals in data set IB. 1.4, together with the residual after subtracting the signal 
model submitted by AEI. The bottom panel shows the same subtraction after extrinsic 
parameters have been recomputed for this entry by maximizing the J-"-statistic. 

Altogether, it must be said that Challenge 2 provided a more forceful 
demonstration of LISA's science objectives for Galactic binaries [5]; but this round of 
challenges was still very useful for new groups to start implementing search methods, 
and for established groups to continue tuning them. The extrinsic-parameter reporting 
errors seen here are easy to commit, because these parameters are very sensitive 
to the modeling of the LISA response, and because their definitions are somewhat 
conventional, but these errors have little bearing on detection confidence. To avoid 
such problems in the future, we plan to provide a web tool to check the recovered SNR 
against the challenge data sets using the fiducial MLDC waveform-generation code. 

2.2. Massive Black Hole Binary Systems: Challenges 1B.2.X 

Each of data sets IB. 2.1 and IB. 2. 2 contained a loud GW signal from a single MBH 
binary embedded in instrument noise. Gravitational radiation was modeled as the 



The Mock LISA Data Challenges: from Challenge IB to Challenge 3 



7 



$ 1.5x10-" 



0.5X1 0- 20 - 



f 

- - .-^J^^**^^^ 


MLDC data 

Residual 


Residual (rec 
f 


MLDC data 

omputed F-statistic) 

\ 



3.005 3.010 
frequency {mHz) 



Figure 2. Combined A and E spectral amplitude for the GW signals in data set 
IB. 1.4, before and after subtracting the AEI signal model. In the bottom panel, 
the extrinsic parameters for all binaries in the model were adjusted by maximizing 
the Jf-statistic. 



restricted waveform for spinless point masses moving on an adabatic sequence of 
circular orbits, evolving according to 2PN energy-balance equations [U sec. 4.4]. (See 
also section [3?2l for the new waveform features being introduced for Challenge 3.) Nine 
parameters are needed to describe each source: the two masses mi and ma, the time of 
coalescence t c , the sky-position angles (3 and A, the luminosity distance Dl, the orbital- 
inclination and GW-polarization angles i and ip, and the initial orbital phase ipo- The 
binaries in both data sets had masses drawn from the same ranges (mi = 1-5 x 10 6 Mq , 
mi/m.2 = 1-4), but were distinguished by the times of coalescence t c — 6 ± 1 months 
for Challenge IB. 2.1 and 400 ± 40 days (past the end of the data set) for Challenge 
IB. 2. 2; the SNR opt for the sources were chosen to be ~ 500 and ~ 80, respectively. 

Two groups submitted entries: JPL (a collaboration between researchers at 
Caltech and at the Jet Propulsion Laboratory) employed a three-step hierarchical 
strategy combining a time-frequency track-search analysis, a template-bank matched- 
filtering search, and a final MCMC stage to evaluate the posterior probability densities 
of source parameters for data sets IB. 2.1 and IB. 2. 2. Cardiff (a collaboration based 
at that university) used a stochastic-template-bank matched-filtering search |12j to 
analyze data set IB. 2.1. 

Table 2] summarizes these entries. While the recovered SNRs are very close 
to SNR pt, which indicates detections of very high confidence, there are large 
discrepancies in the sky-position angles for Challenge IB. 2.1, where both the JPL 
and Cardiff searches converged on secondary likelihood maxima, very close in height 
to the true mode (as shown by the recovered SNR), but quite distant in parameter 
space. In fact, the JPL result places the source almost at the antipodal sky position, 
even if SNR ~ SNR opt to better than one part in a thousand. 

This is a true global degeneracy, which does not appear in local Fisher-matrix 
analyses (another example of why mock-data endeavors are useful!), and which may 
indicate the need, in EM searches of counterparts to LISA binary-MBH detections, to 
examine unconnected regions of the sky. It may however be premature to make such 
an inference, since the degeneracy could be broken by spin effects (now included in 
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Table 4. Relative/absolute errors for the MBH binaries in Challenges IB. 2.1 
and IB. 2. 2. All angles are given in radians. Error estimates have been adjusted 
to account for two perfect symmetries of the waveforms with respect to source 
parameters: if) — * ift + T) and the simultaneous transformation if> — ^ ip + 7r/2, 
<P0 —> <P0 - t/2. 





1B.2.1 (SNR opt 


= 531.84) 


1B.2.2 (SNR opt 


= 80.67) 


Group 


JPL 


Cardiff 


JPL 




SNR 


531.57 


511.78 


79.86 




Arai/mi 


5.991 x 1(T 3 


0.108 


0.122 




Am2/m2 


-5.252 x 10~ 3 


-0.111 


-0.134 




At c [s] 


206.1 


-541.8 


-2688.2 




&D L /D L 


-0.139 


-1.438 


4.781 x 10' 


-3 


A/3 


2.429 


1.374 


5.862 x 10 


-3 


AA 


3.133 


0.548 


-1.461 x 10 


-2 


At 


0.713 


0.678 


-6.955 x 10' 


-2 


At/> 


-0.564 


1.448 


-4.878 x 10 


-2 


A^o 


-2.846 


-2.389 


1.293 x 10' 


-2 



Challenge-3 waveforms) and higher waveform harmonics. In addition, EM-counterpart 
searches would require a data-analysis system capable of determining the sky position 
of MBH binaries a few days in advance of their merger (corresponding to the interval 
between data dumps from LISA to the ground), whereas data set IB. 2.1 included 
the inspiral waveform all the way to the approximate merger frequency. Therefore 
this challenge was not aimed directly at establishing the feasibility of sky-position 
determination for EM-counterpart searches. In this context, it is however worth 
pointing out that the entries provided very accurate determinations of the times of 
coalescence, corresponding to time windows of a few minutes (for Challenge IB. 2.1) 
and about 45 minutes (for IB. 2. 2). 

If we can compare the errors of Table 4 with the parameter-determination 
accuracies predicted in the Fisher-matrix formalism, we see that the JPL estimates 
for mi and mi fall within the 2-er contour for data set IB. 2.1 (notwithstanding the 
problem with sky position), and near the 1-a contour for IB. 2. 2. For data set IB. 2. 2, 
JPL's At c , ADl, A/3, and Aipo, are also close to 1-a, and AA is ~ 2.2ct; the errors 
in l and tp are all tens of as (but the Fisher-matrix formalism is not always reliable 
for extrinsic parameters). Altogether, we conclude that the JPL search essentially 
achieves the theoretical limits of parameter extraction for data set IB. 2. 2, while it 
does so for an important subset of parameters for data set IB. 2.1. 

2.3. Extreme Mass Ratio Inspirals: Challenges 1B.3.X 

Although Challenge 2 saw a few successful detections of EMRI signals 5J, data sets 
IB. 3. 1-5 represented the first real testbed for the search algorithms developed for 
this critical source class. Each data set contained a GW signal from a single EMRI 
embedded in instrument noise, with SNR opt between ~ 55 and ~ 135, and source 
parameters chosen randomly as described in [4j. Fourteen parameters are needed to 
describe each EMRI source [4]: the ecliptic latitude and longitude j3 and A and the 
luminosity distance D^; the central-BH and compact-object masses M and /x; the 
magnitude a and orientation angles 6k, 4>k of the central-BH spin; the initial radial 
orbital frequency Uq and eccentricity eo; and three angles 70, «o & n d A describing the 
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initial orientation of the orbit. 

Entries were received from three groups: BBGP (a collaboration of scientists 
at the AEI, Cambridge, and the University of Southampton); EtfAG (AEI, 
Northwestern, and Cambridge); and MT (Montana State University). EtfAG 
employed a time-frequency technique [15] . whereas BBGP [14] and MT [15] 
developed coherent approaches based on Monte Carlo techniques, differing in their 
implementation. The entries were assessed as discussed at the beginning of this 
section, although the EtfAG time-frequency analysis cannot determine the extrinsic 
parameters, so their recovered SNR and correlation could not be computed. Tables [5] 
and [6] summarize all results. 

Both the time-frequency and coherent approaches succeeded in detecting these 
relatively strong EMRI signals and in constraining their parameters (with especially 
remarkable accuracy for data set 1B.3.1), although not all groups analyzed all data 
sets, and the performance of the same search pipeline varied across them. Challenge 
participants report that in some cases this was due to a lack of time for extended 
computations before the challenge deadline, so some parameter set were submitted as 
"best fits" although they were clearly understood to be secondary likelihood maxima. 
Thus, this early development work indicates that the main challenge for (isolated) 
EMRI analyses is the very complex structure of the likelihood surface in source- 
parameter space, which features a number of secondary maxima of similar height, 
even more so than for MBH binaries. 

We caution the reader that it would be inappropriate at this time to draw general 
conclusions about the relative merits of search methods and about the expected science 
payoff of LISA EMRI astronomy: it is not known how these techniques scale as the 
SNR decreases and in situations where EMRI signals overlap with each other and are 
affected by Galactic confusion noise. The first two complications will be addressed in 
Challenge 3. 

3. Synopsis of Challenge 3 

The third round of the MLDCs consists of five challenges (3.1-3.5). Data sets 3.1-3 
consist of approximately two years of data (2 22 samples at a cadence of 15 s) for time- 
delay interferometry (TDI) observables X, Y, and Z. These data sets are released 
both as time series of equivalent strain generated by the LISA Simulator [TH] and 
as time series of fractional frequency fluctuations generated by Synthetic LISA [TI5] ; 
see [3] p. S556] for the conversion between the two. Indeed (with a few exceptions, 
described below, for 3.4 and 3.5), the Challenge-3 data sets are built using the "pseudo- 
LISA" model of Challenges 1 and 2: the orbits of the LISA spacecraft are e 2 -accurate 
Keplerian ellipses with conventional orientations and time offsets; modified TDI (a.k.a. 
TDI 1.5) expressions are used for the observables; and Gaussian, stationary instrument 
noise is included from six proof masses and six optical benches with known noise levels 
that arc identical across each set of six | See g] for details. 

X The six proof-mass noises are uncorrelated and white in acceleration, with one-sided power spectral 
density (PSD) 

St£{f) = 3 x 10- 15 [1 + (10- 4 Hz//) 2 ] 1 / 2 ms- 2 Hz" 1 / 2 ; 
the six optical-path noises are uncorrelated and white in phase with PSD 
5 o 1 / 2 (/) = 20xl0- 12 mHz- 1 / 2 ; 
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Table 5. Overlaps and recovered SNRs for TDI observables A, E and combined 
recovered SNR for data sets IB. 3. 1-5. 



Group 


C A SNR a C e i 


3NR B 


total SNR 




IB. 3.1 (SNPv p t = 123.7) 






BBGP 


0.57 51.0 0.58 


51.6 


72.5 


MT 


0.998 86.1 0.997 


88.3 


123.4 




IB. 3. 2 (SNR p t = 133.5) 






BBGP 


0.07 6.6 0.18 


18.2 


17.6 


BBGP a 


0.39 37.6 0.41 


39.8 


54.7 


MT 


0.54 49.5 0.54 


50.8 


70.9 




IB. 3. 3 (SNR opt = 81.0) 






BBGP 


-0.06 -4.2 -0.0003 


-0.05 


-3.0 


BBGP a,c 


-0.2 -11.5 -0.32 


-19.0 


-21.5 


MT 


0.38 22.0 0.35 


20.9 


30.4 




IB. 3. 4 (SNR opt = 104.5) 






BBGP C 


0.0007 2.1 -0.0002 


-0.8 


2.1 


BBGP b 


0.16 13.9 0.04 


6.7 


14.6 




IB. 3. 5 (SNR p t = 57.6) 






BBGP 


0.09 3.4 0.1 


4.2 


5.3 



a C and SNR after correcting the sign of /3, lost on input to the MLDC webform. 
b C and SNR after correcting phases at t = 0, to account for a BBGP bug. 
c The BBGP SNRs can be negative because BBGP maximized likelihood 
analytically over amplitude, which makes SNR sign-insensitive (a minus sign 
corresponds to a change of it in the phase of the dominant harmonic). This 
degeneracy is broken when all the harmonics are found correctly. 

• Data set 3.1 contains a Galactic GW foreground from ~ 60 million compact 
binary systems. This data set is a direct descendant of Challenge 2.1, but it 
improves on the realism of the latter by including both detached and interacting 
binaries with intrinsic frequency drifts (either positive or negative) . Section 13.11 
gives details about the binary waveform models, about their implementation in 
the LISAtools suite [16] . and about the generation of the Galactic population. 

• Data set 3.2 contains GW signals from 4-6 binaries of spinning MBHs, on top of 
a confusion Galactic-binary background. This data set improves on the realism 
of Challenges 1.2.1-2 and 2.2 by modeling the orbital precession (and ensuing 
GW modulations) due to spin-orbit and spin-spin interactions. Section I3~2l gives 
details about the MBH-binary waveforms. 

Because this challenge focuses on the effects of spins rather than on the joint 
search for MBH signals and for the brightest Galactic binaries, the background 
is already partially subtracted — it is generated from the population of detached 
binaries used for Challenge 3.1, withholding all signals with SNR > 5. 

• Data set 3.3 contains five GW signals from EMRIs. As in Challenges 1.3.1- 

the conversion to Synthetic LISA'S dimensionless fractional frequency fluctuations is described on 
1 191 p. 6]; the values actually used in the MLDCs are 

S acc (/) = 2.5 X 10- 48 (//Hz)- 2 [l + (1(T 4 Hz//) 2 ] Hz" 1 ; 
S op t(/) = 1.8 x 10- 37 (//Hz) 2 Hz- 1 . 
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Table 6. Errors for a subset of EMRI parameters in Challenges IB. 3. 1-5. 
"Ax/fx]" denote fractional errors relative to the physical or prior range of the 
parameter. Note that due to the narrow prior range the relative error could be 
limited to ~ 10%, therefore we give the number which would show by how much we 
have decreased the prior range. Large errors correspond to the secondary maxima 
in the likelihood, the only true parameters (global maximum) were found by MT 
in Challenge 1.3.1. 



Group 


A/3 


A A 


AB K 


A< ^K Aa A^ 


AM 


A^Q 




A»Q 




TTT 

[: 'I 




175 , - I 


[Al j 


"0 
















Challenge IB. 3.1 












BBGP 


—0.03 


— u.uuoy 


— 0. 14 


n no n qi n ort 
U.Uoo 0.31 —0.20 


— 0.84 


0.026 




0.37 


— 0.022 


EtfAG 


0.019 


-0.0045 


0.56 


0.33 0.16 -0.11 


-0.27 


-9.3 x 10" 


-5 


0.17 


0.078 


MT 


0.0058 


0.0027 


4.4 x 10" 


" 4 0.0051 -0.0022 0.0065 


0.014 


3.2 x 10" 


-6 


-0.0085 


-0.0020 










Challenge IB. 3. 2 












BBGP 


-0.16 


-0.43 


0.46 


-0.33 -0.0088 -0.0040 


0.016 


1.4 x 10" 


-4 


-0.010 


-0.0013 


EtfAG 


-0.014 


0.0042 


0.97 


-0.36 0.0043 -0.046 


-0.069 


-6.5 x 10" 




0.041 


0.0041 


MT 


0.0040 


-0.0086 


0.79 


0.41 0.093 -0.064 


0.35 


-0.035 




0.068 


0.092 










Challenge IB. 3. 3 












BBGP 


0.091 


0.50 


-0.23 


0.045 -0.32 -0.49 


-0.029 


6.1 x 10" 


-4 


0.019 


0.054 


EtfAG 


-0.01 


-0.004 


0.49 


-0.34 0.0073 -0.059 


-0.061 


-7.8 x 10" 




0.038 


0.0061 


MT 


0.045 


-0.019 


-0.1 


0.077 -0.066 0.13 


0.59 


3.6 x 10" 


-4 


-0.33 


0.010 










Challenge IB. 3. 4 












BBGP 


-0.57 


-0.37 


0.37 


-0.31 -0.025 0.020 


-0.88 


0.066 




0.065 


-0.16 


EtfAG 


-0.56 


0.49 


0.56 


-0.34 0.059 0.12 


0.04 


2.8 x 10" 


-4 


-0.039 


0.0040 










Challenge IB. 3. 5 












BBGP 


-0.48 


-0.14 


-0.35 


0.1 -0.094 -0.094 


0.55 


-0.0021 




-0.017 


-0.060 


EtfAG 


-0.58 


0.46 


0.27 


-0.084 0.20 -0.7 


0.83 


-0.066 




0.066 


0.27 



5, EMRI waveforms are modeled with Barack and Cutler's "analytic kludge" 
waveforms [T7]; this challenge introduces the complication of detecting five such 
signals with lower SNRs, and in the same data set. By contrast, Galactic confusion 
is not included. See section [3751 for details. 

Challenges 3.4 and 3.5 address the detection of two GW sources that are new to the 
MLDCs, and that have (respectively) bursty and stochastic characters: thus, these 
searches require an accurate characterization of instrument noise, which in reality 
will not be available a priori, but will be obtained from the LISA measurements 
themselves. To model this problem, in data sets 3.4 and 3.5 the levels of the six + 
six secondary noises have been independently randomized by ±20%; the noises are 
however still uncorrelated. In addition, these data sets contain time series for all 
twelve "raw" LISA phase measurements yijk and [15], so that contestants may 
now build additional TDI observables to help characterize instrument noise. The phase 
measurements do include laser phase noise, because otherwise they would convey extra 
information unavailable from the real LISA; but laser noise is reduced in level to ~ 
ten times the secondary noise at 1 mHz, so that it can be canceled relatively easily 
with TDI 1.5 implemented with moderate timing precision. To wit: 

• Data set 3.4 consists of 2 21 samples at a cadence of 1 s (~ 24 days), and it 
contains GW burst signals from cosmic string cusps, occurring as a Poissonian 
random process throughout the data set, with a mean of five events. Details 
about the waveforms are given in section 13.41 The data set is provided only as 
fractional frequency fluctuations generated by Synthetic LISA. 
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• Data set 3.5 consists of 2 20 samples at a cadence of 2 s (again ~ 24 days), and it 
contains a stochastic GW background, which is isotropic, unpolarized, Gaussian 
and stationary; its spectrum grows at low frequencies as l// 3 , and its magnitude 
is set to a few times the secondary noise over a broad range of frequencies. Details 
about the synthesis of the background and the simpler model of the LISA orbits 
used for this challenge are given in section 13.51 The data set is provided as 
fractional frequency fluctuations generated by Synthetic LISA and by the new 
simulator LISACode [20], recently integrated into the LISAtools suite [IB] : thus, 
cross checks are possible between the two simulators. 

LISACode [20] was developed at APC-Paris with the purpose of accurately mapping 
the impact of the different LISA subsystems on its science observations, and of bridging 
the gap between the basic principles of the LISA measurement and a future, more 
sophisticated end-to-end simulator. Thus, LISACode includes realistic representations 
of most of the ingredients that will influence LISA's sensitivity (such as orbits, 
instrument noise, ultra-stable-oscillator time stamps, phasemeter response functions), 
internal waveform generators for several kinds of sources (monochromatic and chirping 
binaries, stochastic backgrounds, etc.), as well as the the possibility to build various 
TDI combinations. Many user-defined parameters make it possible to study the impact 
of different LISA configurations on its sensitivity. LISACode's conventions follow 
closely those of the MLDCs and of Synthetic LISA. 

All the Challenge-3 data sets can be downloaded at astrogravs . nasa.gov/docs/mldc/round3/datasets .html 
encoded in lisaXML [2j , an XML-based format that can be displayed directly in mod- 
ern web browsers, and handled easily in C/C++, Python, and MATLAB with the 
LISAtools I/O libraries [TBJ- Each data set is released in the blind challenge version 
and in a training version that includes the source parameters used to generate it. 
Additional training data sets can be generated easily with the LISAtools suited 

The remainder of this section describes the GW signal models adopted for each 
data set. See [4] for the conversion of the GW polarizations in source frame (given 
here) to the LISA frame. Table[7]is a glossary of source parameters with their symbols 
and lisaXML descriptors, while table [5] is a summary of the GW content of each data 
set along with the ranges used to choose source parameters randomly. 

3.1. Chirping Galactic binaries 

Data set 3.1 contains GWs from a population of ~ 26 x 10 6 detached and ~ 34 x 10 6 
interacting Galactic binaries. Each binary is modelled as a system of two point masses 
mi and m% in circular orbit with linearly increasing or decreasing frequency (depending 
on whether gravitational radiation or equilibrium mass transfer is dominant). The 
polarization amplitudes at the Solar-system barycenter, expressed in the source frame, 
are given by 




where the amplitude is derived from the physical parameters of the source as A = 
(2/j,/ 'Dl)(ttM /) 2//3 , with M = mi + m 2 the total mass, (i = m\milM the reduced 



§ After installing LISAtools following the instructions at code . google . com/p/lisatools/wiki/Install 

generating a training set is as simple as running (say, for Challenge 3.1) 

MLDCpipelines2/bin/challenge3.py -T -R 3.1 
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Table 7. Source paramet 


ers in Challenge 3. 






Symbol 


Standard parameter name 


fttanrlarrl unit 






(lisaXML descriptor) 


(lisaXML descr.) 




Common parameters 




Ecliptic latitude 





EclipticLatitude 


Radian 


Ecliptic longitude 


A 


EclipticLongitude 


Radian 


Polarization angle 




Polarization 


Radian 


Inclination 




Inclination 


Radian 


Luminosity distance a 


D L 


Distance 


Parsec 




Galactic binaries 




Amplitude 13 


A 


Amplitude 


1 (GW strain) 


Frequency 


f 


Frequency 


Hertz 


Frequency derivative 


f 


FrequencyDerivative 


Hertz/Second 


Initial GW phase 




InitialPhase 


Radian 


Spinning massive black-hole binaries 




Masses of component MBHs 


mi, 7712 


Massl, Mass2 


SolarMass 


Magnitude of spins Si, 52 


01, ai 


Spinl, Spin2 


MassSquared 


Initial orientation of spin Si 


9si,4>si 


Polar AngleOf Spinl 


Radian 






AzimuthalAngleOf Spinl 


Radian 


Initial orientation of spin S2 


QS2,4>S2 


. . . likewise 




Time to coalescence 


T c 


CoalescenceTime 


Second 


Phase at coalescence 




Phase At Coalescence 


Radian 


Initial orientation 


Ol, 4>l 


InitialPolarAngleL 


Radian 


of orbital momentum 




Init ialAzimuthalAngleL 


Radian 




EMRIs: 


see table 5 of 






Cosmic string cusp bursts 




Amplitude 13 (Fourier) 


A 


Amplitude 


Hertz" (1/3) 


Central time of arrival 


to 


CentralTime 


Second 


Maximum frequency 13 


/max 


MaximumFrequency 


Hertz 




Isotropic stochastic background 




PSD b ' d at 1 Hz 


s h 


PowerSpectralDensity 


(f/Hz)~-3/Hz 



a We do not deal explicitly with the redshifting of sources at cosmological distances, so Dl 
is a luminosity distance, and all masses and frequencies are measured at the Solar-system 
barycenter and red/blue-shifted by factors (1 + z)* 1 with respect to those measured locally 
near the sources. 

b Replaces Dl for Galactic binaries, cosmic-string-cusp bursts, and stochastic-background 
pseudosources. 

c Effectively replaces l for cosmic-string-cusp bursts. 

d Note also that S h = 5j; ot /384; ip is set to 0, and 1 not used. 
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Table 8. Summary of data-set content and source-parameter selection in 
Challenge 3. Parameters are sampled randomly from uniform distributions 
across the ranges given below, and all angular parameters (including spin and 
orbital-angular-momentum directions for MBH binaries) are drawn randomly 
from uniform distributions over the entire appropriate ranges. Source distances 
are set from individual-source SNRs, which are drawn randomly from the ranges 
specified below (in Challenge 3, "SNR" refers to the multiple-TDI-observable 
SNR approximated as y/2 X max{SNR x , SNRy, SNR Z }). The MBH time of 
coalescence t c and the cosmic-string— cusp burst central time tc are given relative 
to the beginning of the relevant data sets. 



Data set 


Sources 


Parameters 


3.1 


Galactic-binary background 


randomized population (see section l3~Tj) 






~ 34 x 10 6 interacting, ~ 26 x 10 6 detached 




plus 20 verification binaries 


known parameters (see section l3~Tj) 


3.2 


4-6 MBH binaries 


for each: mi = 1-5 x 10 6 M©, mi/m2 = 1-4, 






ai/mi = 0-1, aijmi = 0-1 




. . . including 


MBHi: t c = 90 ± 30 days, SNR ~ 2000 






mbh 2 : t c = 765 ± 15 days, SNR ~ 20 




. . . and 2-4 chosen from 


MBH 3 : t c = 450 ± 270 days, SNR ~ 1000 






mbh 4 : t c = 450 ± 270 days, SNR ~ 200 






MBH 5 : t c = 540 ± 45 days, SNR ~ 100 






MBH 6 : t c = 825 ± 15 days, SNR ~ 10 




plus Galactic confusion 


randomized population with approx. SNR < 5 






~ 26 x 10 6 binaries; no verification 


3.3 


5 EMRIs 


for each: fj, = 9.5-10.5 M , 5" = 0.5-0.7 M 2 , 






time at plunge = 2 21 -2 22 x 15 s, 






ecc. at plunge = 0.15-0.25, SNR = 10-50 




. . . including 


EMRli: M = 0.95-1.05 x 10 7 M o 






emri 2 and emri 3 : M = 4.75-5.25 x 1O 6 M 






EMRI4 and EMRI5: M = 0.95-1.05 x 1O 6 M 



3.4 n Cosmic-string-cusp bursts (with n Poisson-distributed with mean 5) 

/max = 10" 3 " 1 Hz, t c = 0-2 21 s, SNR = 10-100 
all instrument noise levels randomized ±20% 



3.5 Isotropic stochastic background 2 x 192 incoherent h+ and h x sources over sky 

Sl ot = 0.7-1.3 x 10" 47 (//Hz)- 3 Hz- 1 
all instrument noise levels randomized ±20% 



mass, and Dl the distance; / is the (constant) frequency derivative, and 4>o is the 
phase at t = 0. 

Since it would be unfeasible to process millions of barycentric binary waveforms 
individually through the LISA simulators to compute the TDI-observable time 
series, we adopt a fast frequency-domain method [21] that rewrites the LISA phase 
measurements as the fast-slow decomposition 

VH (t) = C(t) cos(2^/ <) ± S(t) sm(2nfot); (3) 

the functions C(t) and S(f) describe slowly varying effects such as the rotation of the 
LISA arms, the Doppler shift induced by orbital motion, and the intrinsic frequency 
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Figure 3. Histogram of the density of Galactic binaries in the Nelemans catalogs, 
binned by log 10 / and log 10 |/|. 

evolution of the source. These "slow" terms can be sampled very sparsely and 
Fourier-transformed numerically while the "fast" sine and cosine terms can be Fourier- 
transformed analytically. The results are then convolved to produce the LISA phase 
measurements, and these are assembled into the desired TDI variables. This algorithm 
is three to four orders of magnitude faster than the time-domain LISA simulators, 
although it effectively approximates LISA as a rigidly rotating triangle with equal and 
constant armlengths. See [H] for full details, and directory MLDCwavef orms/Galaxy3 
in LISAtools for the source code. 

The starting point for each realization of data set 3.1 are two large catalogs pro- 
vided by Gijs Nelemans (files |MLDCwavef orms/Galaxy3/Data/AMCVn_ GWR_MLDC Tdat 
and dwd_GWR_MLDC.dat in the LISAtools installation), which contain the parameters 
of 26.1 x 10 6 detached and 34.2 x 10 6 interacting systems produced by the population 
synthesis codes described in [22] [23]. Figure [3] shows the distribution of the binaries in 
the catalogs over / and /. Recent work by Roelofs, Nelemans and Groot [24] suggests 
that the model in [23] overpredicts the number of (AM CVn) interacting systems by 
a factor of 5-10, but we did not implement this correction for Challenge 3. 

The parameters of each binary in the catalogs are modified by randomly tweaking 
/ by ±1%, A by ±10%, (3 and A by ±0.5°, and by randomly assigning ip, t, 
and tf>o {f is computed from the catalog's binary-period derivative and from the 
tweaked /). These random perturbations are large enough to render the original 
population files useless as answer keys, but small enough to preserve the overall 
parameter distributions. Binaries with approximate singlc-Michelson SNR > 10 are 
regarded as "bright" and listed in a separate table in the challenge keys. Data set 3.1 
includes also 20 verification binaries of known parameters (specified in LISAtools file 
MLDCwavef orms/Galaxy3/Data/Verif ication.dat as rows of /, /, (3, A, A). 

3.2. Spinning MBH binaries 

The spinning-MBH-binary GW signals of data set 3.2 are modeled as restricted 
waveforms (no higher harmonics) from 2PN circular adiabatic inspirals, with 
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uncoupled orbital frequency evolution and spin and orbital precession. Both the orbital 
phase and frequency are computed as explicit functions of time, corresponding to T3 
waveforms in the classification of [|'2 5 ; : 



Mcj = -T~ 3 / 8 1 



743 



2688 
/ 1855099 



11 

32 5 



11 \ -1/4 ° 
— ?? r ' 

' ' 10 



/3 

7T 

4 



-3/8 



56975 



371 



r'7 



where M = m,\ 
ratio, and 



V 14450688 258048 ' ' 2048' 
m 2 is the total mass, r\ — m\mijM' 1 



64 



-1/2 



(4) 



5M V ; ' 



/i/M is the symmetric mass 

(5) 



i=l,2 



Xii^N-Si) (I13^ + 75r, 



247(S 1 -S 2 )-721(L N -S 1 )(L J vS 2 ) 



(6) 



(7) 



Here Ljy, Si and S 2 are the unit vectors along the leading-order angular orbital 
momentum and the MBH spins. The intrinsic orbital phase is 



r 5/8 



3>orb = $C 



1 



/3715 55 
I 8064 + 96 



( 9275495 284875 



V 14450688 258048 



n - 



1855 
2048 



15 
64 ( 



-1/2 



(8) 



however, because the spin-orbit coupling causes the orbital angular momentum to 
precess around the total angular momentum, the phase that enters the gravitational 
waveforms contains an additional correction 1261: 



(Ljv • n)[Ljv x n] • t N _ 



5$, 



(9) 



1 - (Lat • n 

where n is direction to the source. The constant of integration in this equation can 
be redefined so that <5<& = 0at£ = 0. The equations of precession for Ljv, Si and S 2 
are given by (2.9)-(2.11) in [27]. In the source frame the gravitational polarizations 
are then given (with respect to a time-varying polarization basis) by 

, _ 2 M 



D 



(1 + cos 2 i)(Mw) 2/3 cos 2$, 



^ cos i{Mujf> ' 3 sin 2$, 



(10) 



where cosi = (L^ ■ n). The polarizations and hf. in a fixed polarization basis are 
obtained by way of a rotation by the instantaneous polarization angle 

sin (3 cos (X — 4>l) sin0L — cos8l cos [3 



cos p sin (A — (f>L) 
(where 9l and cf)^ define the direction of Ljv) yielding 

hi_ = — h + cos 2i/j — h x sin 2-0, 
h x = h + sin 2ip — h x cos 2-0. 



(11) 

(12) 
(13) 
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The end of the inspiral is handled with the exponential taper used also for the 
MBH-binary waveforms of Challenge 2 See directory MLDCwavef orms/FastBBH 
in LISAtools for the source code for these waveforms. 

Data set 3.2 includes also a Galactic confusion background generated from 
the same detached-binary population as used in Challenge 3.1 (interacting systems 
have typically very small chirp masses and are not expected to make a significant 
contribution), but withholding all binaries with individual A + E SNR > 5, relative to 
instrument noise plus an estimate of confusion noise, which was derived using a BIC 
criterion for the resolvability of individual Galactic binaries |21) : 

l(r 44 - 62 (//Hz)- 2 - 3 for/ e [l(T 4 ,l(r 3 ] Hz, 

_ 1fi 2-2 w -i J 10- 5a92 (//Hz)- 4 - 4 for/ e [10- 3 ,l(r 2 - 7 ] Hz, 
>x, g ai-iba; sin xtiz x <; 10 -62.8 (//Hz) -8.s for/ e [10 -2. 7) 10 -2.4] HZj U4j 

10 -89.68 (//Hz) -20.0 for/ e [ 10 -2.4 1Q -2.0] Hz 

(fractional frequency fluctuations, with x = 2?rfL, L ~ 16.6782 s). The resulting 
confusion background is consistent with (|14[) . which is also used in Challenge 3 (on 
top of instrument noise) to define the SNRs of GW signals from MBH binaries, EMRIs 
(Challenge 3.3) and cosmic-string cusps (Challenge 3.4). 



3.3. EMRIs 

The EMRI waveforms of data set 3.3 are the Barack-Cutler [17] "analytic kludges" 
used for Challenge 1.3.1-5 and described in [4] sec. 4.5], with the single change in that 
the number of eccentric-orbit harmonics included in the waveform does not evolve with 
eccentricity, but is fixed at five (lisaXML parameter FixHarmonics; a value of zero 
will reproduce the old behavior). See directory MLDCwavef orms/EMRI in LISAtools for 
the source code. 



3.4- Cosmic string cusps 

Data set 3.4 contains a number of bursts from cosmic strings, the first of two new 
GW sources introduced with Challenge 3. Cosmic strings are linear topological 
defects that may be formed in early Universe at the phase transitions predicted in 
many elementary-particle and supcrstring models. Cosmic-string oscillations emit 
gravitational radiation, with a substantial part of the emission from cusps, which can 
achieve very large Lorentz boosts |28j . In the limit where the tip of a cusp is moving 
directly toward the observer, the observed metric perturbation is a linearly polarized 
GW with [29] 

GuL 2 / 3 

h(t) = A\t — tcl 1 ^ 3 x (incomplete T function envelope), A ~ ; (15) 

Dl 

here tc is the burst's central time of arrival, G is Newton's constant, /i is the string's 
mass per unit length, Dl is the luminosity distance to the source, and L is the size 
of the feature that produces the cusp (e.g., the length of a cosmic string loop). If 
the observer's line of sight does not coincide with the cusp's direction of motion, 
the waveform becomes a much more complicated mixture of polarizations |30| . If 
the viewing angle a departs only slightly from zero (which is our assumption), the 
waveform remains dominantly linearly polarized, and the sharp spike in (|15[) is rounded 
off, introducing an exponential suppression of Fourier-domain power for frequencies 
above / max = 2/(a 3 L). 
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Following the model used by the LIGO Science Collaboration, we define our cusp 
waveforms in the Fourier domain according to 

|M/)l = 4T 4/3 (i + (/iow//)T 4 . ^=o, (16) 

with exp(l — ///max) suppression above / max - The amplitude A has dimensions Hz 1 / 3 ; 
/low sets the low-frequency cutoff of what is effectively a fourth-order Butterworth 
filter, which prevents dynamic-range issues with the inverse Fourier transforms (for 
Challenge 3 we set /i ow = 1 X 1CP 5 Hz). The phase of the waveform is set to 
exp i(7r — lixftc) before inverse- Fourier transforming to the time domain. See directory 
MLDCwavef orms/CosmicStringCusp in LISAtools for the source code. 



3.5. Stochastic background 

Data set 3.5 contains the second GW source new to Challenge 3: an isotropic, 
unpolarized, Gaussian and stationary stochastic background. Allen and Romano 
[3Tj define a stochastic background as the "gravitational radiation produced by an 
extremely large number of weak, independent, and unresolved gravity- wave sources, 
[...] stochastic in the sense that it can be characterized only statistically." Such 
backgrounds are usually characterized by the dimensionless quantity 

n 8W (/) = — p^- f , (17) 

p crit d log / 

with p gw the energy density in GWs, and p CI n = 3c 2 Hq/(8itG) the closure energy 
density of the Universe, and they are idealized as the collective, incoherent radiation 
of uncorrelated infinitesimal emitters distributed across the sky. If the background 
is isotropic, unpolarized, Gaussian and stationary, the Fourier amplitude h A {f, of 
each emitter (with A indexing the + and x polarizations, and Cl the direction on the 
two-sphere) is completely characterized by the power-spectral-density relation [31) 

(h* A (f,n)h A ,(f,ft)) = ^l/r^d/i) x <w(/ - f)s 2 (Ci,Ci'). (is) 

In Challenge 3, we assume a constant ^(J), as appropriate for the primordial 
background predicted in simple cosmological scenarios. We implement the 
uncorrelated emitters as a collection of 192 pseudosources distributed at HEALPix 
pixel centers across the sky. HEALPix (the Hierarchical Equal Area isoLatitudc 
Pixelization of spherical surfaces [32]) is often used to represent cosmic microwave 
background data sets; 192 pixels correspond to a twice-refined HEALPix grid with 
A sido = 2 2 . 

Each pseudosource consists of uncorrelated pseudorandom processes for h-\- and 
h x , generated as white noise in the time domain, and filtered to achieve the /~ 3 
spectrum of (|18p . using the the recursive l// 2 filtering algorithm proposed by 
Plaszczynski [53], extended to spectral slope —3. The algorithm employs a chain 
of l// 2 infinite-impulse-response filters to reshape the white noise spectrum between 
minimum and maximum frequencies /i ow and /knoci set to 10~ 5 and 10~ 2 Hz in this 



Challenge (see the source file MLDCwavef orms/Stochastic .py in LISAtools for the 
Synthetic LISA implementation). 

The one-sided PSD of each single-polarization random process (which represents 
the finite area of a pixel in the sky) is then given by Sh(f)/2 = 3i/ 2 /(327r 3 )/ _3 fi gw x 
(47r/192). In data set 3.5, we define Sj" 1 — (192 x 2)Sh and we set it so that, in the 
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TDI observables, the GW background is a few times stronger than LISA's secondary 
instrument noise. Namely, 

Sl ot (f) = 0.7-1.3 x 10" 47 (//Hz)- 3 Hz" 1 (19) 

(taking H = 70km/s/Mpc, this corresponds to ft gw = 8.95 x 10 _12 -1.66 x lCT 11 ). 

One of the more promising approaches to detect GW backgrounds with LISA 
relies on estimating instrument noise levels by way of symmetrized TDI observables 
that are insensitive to GWs at low frequencies in the LISA band [3H [3S1 [35] ■ For 
realistic LISA orbits, however, the low-frequency behavior of such observables becomes 
more complicated than discussed in the literature. To simplify the initial investigation 
of the background-detection problem in data set 3.5, we have therefore approximated 
LISA as a rigidly rotating triangle with equal and constant armlengths (i.e., Synthetic 
LISA's [CircularRotating| ). 

4. Conclusion 

Since their inception, the Mock LISA Data Challenges have received remarkable 
support from the GW community, and have set the stage for many practical 
demonstrations of the feasibility of LISA's exciting science with present-day data- 
analysis techniques. Future challenges will feature ever more realistic models of 
waveforms and instrument noise, and they will endeavour to scope out all important 
aspects of the LISA science objectives. In addition, the software tools developed 
for the MLDCs [IH] can be used to generate data sets for many other data- 
analysis experiments outside the main challenges; and the MLDC standard model 
of LISA's observations (including the MLDC "pseudo-LISA" and GW models) is 
proving extremely valuable to the current analytical investigations of the LISA science 
performance that are being run by the LIST. 

To obtain more information and to participate in the MLDCs, see the 
official MLDC website (astrogravs.nasa.gov/docs/mldc) and the Task Force wiki 
( www . tapir . caltech . edu/listwglb| . 

Acknowledgments 

SB and EP were supported by the Deutsches Zentrum fur Luft- und Raumfahrt; LWs 
work by the Alexander von Humboldt Foundation's Sofja Kovalevskaja Programme 
funded by the German Federal Ministry of Education and Research. JG thanks 
the Royal Society for support and the Albert Einstein Institute for hospitality and 
support while part of this work was being completed. SF was supported by the 
Royal Society, BSS and IH by the UK Science and Technology Facilities Council. 
MV was supported by the LISA Mission Science Office and by JPL's HRDF. CC's, 
JC's and MV's work was carried out at the Jet Propulsion Laboratory, California 
Institute of Technology, under contract with the National Aeronautics and Space 
Administration. MT acknowledges support from the Spanish Ministerio de Eduacion 
y Ciencia Research projects FPA-2007-60220, CSD207-00042 and the Govern de les 
Hies Balears, Conselleria d'Economia, Hisenda i Innovacio. IM was partially supported 
by NASA ATP Grant NNX07AH22G to Northwestern University. 



The Mock LISA Data Challenges: from Challenge IB to Challenge 3 



20 



References 

[1] Bender P, Danzmann P and the LISA Study Team 1998 "Laser Interferometer Space Antenna 
for the Detection of Gravitational Waves, Pre-Phase A Report" MPQ 233 (Garching: Max- 
Planck-Institiit fur Quantenoptik) 

[2] Arnaud K A et al. (the MLDC Task Force) 2006 Laser Interferometer Space Antenna: 6th 
International LISA Symp. (Greenbelt, MD, 19-23 Jun 2006) ed Merkowitz S M and Livas 
J C (Melville, NY: AIP) p 619; ibid p 625 (long version with lisaXML description: Preprint 
|gr^qc/0609106} 

[3] Arnaud K A et al. (the MLDC Task Force and Challenge 1 participants) 2007 Class. Quant. 
Grav. 24 S529 

[4] Arnaud K A et al. (the MLDC Task Force) 2007 Class. Quant. Grav. 24 S551 
[5] Babak S et al. (the MLDC Task Force and Challenge 2 Participants) 2008 Class. Quant. Grav. 
25 114037 

[6] Vallisneri M, Crowder J and Tinto M 200 8 Class. Quant. Grav. 25 065005 

[7] Xspec, an X-ray Spectral Fitting Package, heasarc.gsfc.nasa.gov/docs/xanadu/xspec 
[8] Jaranowski P, Krolak A and Schutz B F 1998 Phys. Rev. D 58 063001 

[9] Whelan J T, Prix R and Khurana D 2008 "Mock LISA Data Challenge IB: improved search 
for galactic white dwarf binaries using an F-statistic template bank" in this volume (Preprint 
larXiv:0805.1972)l 

[10] Trias M, Vecchio M and Veitch J 2008 "Markov Chain Monte Carlo searches for Galactic binaries 

in MLDC-1B data sets" in this volume ( Preprint rarXiv:0 804.4029 ) 
[11] Gijs Nelemans' LISA wiki, www.astro.kun.nl/~nelemans/dokuwiki 

[12] Harry I W, Fairhurst S and Sathyaprakash B S 2008 "A hierarchical search for gravitational waves 

from supermassive black hole binary mergers" in this volume (Preprint arXiv:0804.3274 ) 
[13] Gair J R, Mandel I and Wen L "Improved time-frequency analysis of extreme-mass-ratio inspiral 

signals in mock LISA data" in this volume (Preprint arXiv:0804.1084) 
[14] Gair J R, Babak S, Porter E K and Barack L 2008 "A Constrained Metropolis-Hastings Search 

for EMRIs in the Mock LISA Data Challenge IB" in this volume (Preprint larXiv:0804.3322)l 
[15] Cornish N J 2008 "Detection Strategies for Extreme Mass Ratio Inspirals" in this volume 

(Preprint larXiv:0804."3323l 
[16] LISAtools website and SVN repository, lisatools.googlecode.com 
[17] Barack L and Cutler C 2004 Phys. Rev. D 69 082005 

[18] Cornish N J and Rubbo L J 2003 Phy s. Rev. D 67 022001; erratum-ibid. 029905; LISA Simulator 

website, www.physics.montana.edu/lisa; now included in 1 161 
[19] Vallisneri M 2005 Phys. Rev. D 71; Synthetic LISA website, www.vallis.org/syntheticlisa; 

now included in 1161 

[20] Petiteau A, Auger G, Halloin H, Jeannin O, Plagnol E, Pireaux S, Regim- 
beau T and Vinet J-Y 2008 Phys. Rev. D 77 023002; LISACode website, 
www.apc.univ-paris7.fr/LISA-France/analyse.phtml; now included in |16| 

[21] Cornish N J and Littenberg T B 2007 Phys. Rev. D 76 083006 

[22] Nelemans G, Yungelson L R and Portegies Zwart S F 2001, Astron. Astrophys. 375 890 
[23] Nelemans G, Yungelson L R and Portegies Zwart S F 2004, Mon. Not. Roy. Astron. Soc. 349 
181 

[24] Roelofs G H A, Nelemans G and Groot P J 2007 Mon. Not. Roy. Astron. Soc. 382 685 

[25] Damour T, Iyer B and Sathyaprakash B S 2001 Phys. Rev. D 63 044023 

[26] Apostolatos T A, Cutler C, Sussman G J and Thorne K S 1994 Phys. Rev. D 49 6274 

[27] Lang R N and Hughes S A 2006 Phys. Rev. D 74 122001 

[28] Damour T and Vilenkin A 2000 Phys. Rev. Lett. 85 3761 

[29] Siemens X, Creighton J, Maor I, Ray Majumder S, Cannon K and Read J 2006 Phys. Rev. D 
73 105001 

[30] Siemens X and Olum K D 2003 Phys. Rev. D 68 085017 
[31] Allen B and Romano J D 1999 Phys. Rev. D 59 102001 

[32] Gorski K M et al. 2005 Astrophys. J. 622 759; HEALPix website, [healpix ■ j pi ■ nasa . gov | 
[33] Plaszczynski S 2007 Fluctuation Noise Lett. 7 Rl 

[34] Tinto M, Armstrong J W and Estabrook F B 2000 Phys. Rev. D 63 021101 
[35] Hogan, C J and Bender P L 2001 Phys. Rev. D 64 062002 

[36] Robinson E L, Romano J D and Vecchio A 2008 "Search for a stochastic gravitational-wave 
signal in the second round of the Mock LISA Data Challenges" in this volume (Preprint 
lajXiv:0804.4144l 



